Fast Action Retrieval from Videos via Feature Disaggregation

نویسندگان

  • Jie Qin
  • Li Liu
  • Mengyang Yu
  • Yunhong Wang
  • Ling Shao
چکیده

Learning based hashing methods, which aim at learning similarity-preserving binary codes for efficient nearest neighbor search, have been actively studied recently. A majority of the approaches address hashing problems for image collections. However, due to the extra temporal information, videos are usually represented by much higher dimensional (thousands or even more) features compared with images, causing high computational complexity for conventional hashing schemes. In this paper, we propose a simple and efficient hashing scheme for high-dimensional video data. This method, called Disaggregation Hashing, exploits the correlations among different feature dimensions. An intuitive feature disaggregation method is first proposed, followed by a novel hashing algorithm based on different feature clusters. We demonstrate the efficiency and effectiveness of our method by theoretical analysis and exploring its application on action retrieval from video databases. Extensive experiments show the superiority of our binary coding scheme over state-of-the-art hashing methods.

منابع مشابه

AN ENHANCED CONTENT-BASED VIDEO RETRIEVAL SYSTEM BASED ON QUERY CLIP T.N.SHANMUGAM and PRIYA RAJENDRAN

Content-based search and retrieval of video data has become a challenging and important issue. Video contains several types of audio and visual information which are difficult to extract, combine or trade-off in common video information retrieval. This research work is the enhanced version of our previous research with texture feature extraction. In this paper, we address the specific aspect of...

متن کامل

Comprehensive Performance Comparison of Fourier, Walsh, Haar, Sine and Cosine Transforms for Video Retrieval with Partial Coefficients of Transformed Video

The desire of better and faster retrieval techniques has always fuelled to the research in content based video retrieval (CBVR). The extended comparison of innovative content based video retrieval (CBVR) techniques based on feature vectors as partial coefficients of transformed video frames using various orthogonal transforms is presented in the paper. Here the popular transforms are considered...

متن کامل

Retrieving Actions in Group Contexts

We develop methods for action retrieval from surveillance video using contextual feature representations. The novelty of our proposed approach is two-fold. First, we introduce a new feature representation called the action context (AC) descriptor. The AC descriptor encodes information about not only the action of an individual person in the video, but also the behaviour of other people nearby. ...

متن کامل

Mining spatiotemporal video patterns towards robust action retrieval

In this paper, we present a spatiotemporal co-location video pattern mining approach with application to robust action retrieval in YouTube videos. First, we introduce an attention shift scheme to detect and partition the focused human actions from YouTube videos, which is based upon the visual saliency [13] modeling together with both the face [35] and body [32] detectors. From the segmented s...

متن کامل

Convolutional Architecture Exploration for Action Recognition and Image Classification

Convolutional Architecture for Fast Feature Encoding (CAFFE) [11] is a software package for the training, classifying, and feature extraction of images. The UCF Sports Action dataset is a widely used machine learning dataset that has 200 videos taken in 720x480 resolution of 9 different sporting activities: diving, golf, swinging, kicking, lifting, horseback riding, running, skateboarding, swin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

متن کامل
عنوان ژورنال:
  • Computer Vision and Image Understanding

دوره 156  شماره 

صفحات  -

تاریخ انتشار 2015